Method and system for creating a fragmented video recording of events on a screen using serverless computing

ABSTRACT

A method for creating a fragmented recording of events on a screen of a computer may receiving a start recording request to record screen events. The screen events may be captured and stored into video fragment files in a queue in a storage using video fragment settings. The video fragment files in the queue may be sent over a communication network to a server. The video fragment settings may be dynamically changed upon detecting a backlog in the queue of the video fragment files. In response to receiving a stop recording request from the server, a notification may be sent to the server that all of video fragment files in the queue were relayed to the server over the communication network for the server to merge the video fragment files into a single video recording file.

FIELD OF THE INVENTION

The present invention relates to video recording, and more specifically to a method and system for creating a fragmented video recording of events on a screen using serverless computing.

BACKGROUND OF THE INVENTION

A company may use customer call centers to provide customer service for their products and/or services. The customer experience in interacting with agents at the customer service center may depend, for example, on the efficiency of the agent to solve the customer's problem in a single interaction between the agent and the customer.

To track an agent's efficiency in handling the customer's call and solving the customer's problem, the company may monitor and/or record customer calls of the agents for quality control. The desktop or workstation of each agent may include multiple monitors, displays, or screens. The screens may be used to provide different information to the agent simultaneously in real time, such as for example, information regarding the customer, the product and/or services in question, and/or any other information needed by the agent to solve the customer's problem regarding the company's product and/or services.

Quality control personnel and/or the agent's managers, for example, may monitor the agent's customer calls for assessing agent performance and/or agent efficiency by accessing a video recording made of events on screen of the agent's terminal or desktop during a customer call, for example. Typically, the information presented and/or accessed by the agent on the screen during each customer call may be recorded into a single video recording along with the audio and/or video portion of the conversation between the customer and the customer service agent.

In that quality control personnel and/or the agent's supervisors may be at different geographical locations, cloud computing services may be used to relay the agent's screen events to the quality control personnel and/or the agent's supervisors at the remote location to be recorded in real time. This may induce large time delays between the time that the supervisor sends a request to the agent's terminal to relay the screen events and the time that the supervisor actually views the streaming video of the screen events on the agent's terminal in real time, which may be recorded on the supervisor's workstation, for example.

Current solutions to provide real-time streaming overcome a wide variety of firewalls and Network Address Translation (NAT)/Port Address Translation (PAT) restrictions/configurational issues. These solutions implement complicated signaling schemes such as web sockets, Web Real-Time Communication (WebRTC), Session Traversal Utilities for NAT (STUN), and Traversal Using Relay NAT (TURN) to overcome NAT restrictions.

Furthermore in using these current solutions, real time session establishment of the communication between the agent and supervisor terminal, for example, has a large time overhead or delay time (e.g., 3-4 sec), which may result in a loss of the video at the beginning of the agent's call. These real time solutions may be also sensitive to network stability such as temporary network outages/disconnections and/or bandwidth for insuring video quality. A bad user experience may occur with seek operations due to sizing issues (fixed 3000 frames) of a group of pictures, or GOP structure, which specifies the order in which intra- and inter-video frames are arranged.

Thus, there may be a need for a real time, cloud-based video recording of events on agent's desktop screens that overcome these limitations of real-time streaming solutions.

SUMMARY OF THE INVENTION

There is thus provided, in accordance with some embodiments of the present invention, a computerized method for creating a fragmented recording of events on a screen of a computer. The method may include, in a processor of a computer configured to execute code for recording events on a screen of the computer, receiving a start recording request to record the screen events from a server communicating with the computer over a communication network. The events on the screen may be captured. The captured screen events may be stored in a storage of the computer into a plurality of video fragment files in a queue in the storage using video fragment settings. The plurality of video fragment files in the queue may be sent over the communication network to the server. The video fragment settings may be dynamically changed upon detecting a backlog in the queue of the plurality of video fragment files in the storage. In response to receiving a stop recording request from the server, a notification may be sent to the server that all of the plurality of video fragment files in the queue were relayed to the server over the communication network for the server to merge the plurality of video fragment files into a single video recording file.

Furthermore, in accordance with some embodiments of the present invention, sending the plurality of video fragments may include uploading the plurality of video fragment files to a distributed cloud storage.

Furthermore, in accordance with some embodiments of the present invention, detecting the backlog in the queue may include detecting that a number of the plurality of video fragment files exceeds a predefined number.

Furthermore, in accordance with some embodiments of the present invention, dynamically changing the video fragment settings may include changing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, dynamically changing the video fragment settings may include changing a frame rate of the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, dynamically changing the video fragment settings may include changing a frame duration of the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, the captured screen events may be selected from the group consisting of: audio signals captured by an input device, audio signals captured through an output device, videos captured on the screen, and images captured on the screen.

Furthermore, in accordance with some embodiments of the present invention, the communication network may use a Hypertext Transfer Protocol Secure (HTTPS) protocol for sending the plurality of video fragment files to the server.

There is further provided, in accordance with some embodiments of the present invention, a client computer for creating a fragmented recording of events on a screen of the computer. The client computer may include a storage, a screen, and a processor. The processor may be configured to execute code for recording events on the screen, to receive a start recording request to record the screen events from a server communicating with the computer over a communication network, to capturing the events on the screen, to store the captured screen events into a plurality of video fragment files in a queue in the storage using video fragment settings, to send the plurality of video fragment files in the queue over the communication network to the server, to dynamically change the video fragment settings upon detecting a backlog in the queue of the plurality of video fragment files in the storage, and in response to receiving a stop recording request from the server, to send a notification to the server that all of the plurality of video fragment files in the queue were relayed to the server over the communication network for the server to merge the plurality of video fragment files into a single video recording file.

Furthermore, in accordance with some embodiments of the present invention, the storage may include a local disk on the client computer.

Furthermore, in accordance with some embodiments of the present invention, the code for recording the events on the screen may include a screen agent client software.

Furthermore, in accordance with some embodiments of the present invention, the plurality of video fragment files may include a plurality of standard containers.

Furthermore, in accordance with some embodiments of the present invention, the processor may be configured to detect the backlog in the queue by detecting that a number of the plurality of video fragment files exceeds a predefined number.

Furthermore, in accordance with some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a frame rate of the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a frame duration of the plurality of video fragment files for managing the backlog.

Furthermore, in accordance with some embodiments of the present invention, the captured screen events may be selected from the group consisting of: audio signals captured by an input device, audio signals captured through an output device, videos captured on the screen, and images captured on the screen.

Furthermore, in accordance with some embodiments of the present invention, the communication network may use a Hypertext Transfer Protocol Secure (HTTPS) protocol to send the plurality of video fragment files to the server.

There is further provided, in accordance with some embodiments of the present invention, a computerized method for managing a fragmented recording of events on a screen of a client computer in a computerized system by a server. The method may include in a processor of a server communicating with a client computer over a communication network, sending by the processor to the client computer a start recording request to record events on a screen of the client computer. A plurality of video fragment files may be received by the processor for storage in a location of a storage in the server, where each of the video fragment files include screen events captured on the screen of the client computer. A stop recording request may be sent by the processor to the client computer. A notification may be receive by the processor that all of the plurality of video fragment files were relayed from the client computer over the communication network. In response to the received notification, the plurality of video fragment files may be merged by the processor into a single video recording file.

Furthermore, in accordance with some embodiments of the present invention, the processor may execute a serverless computing service for managing the plurality of video fragment files to create the single video recording file.

BRIEF DESCRIPTION OF THE DRAWINGS

In order for the present invention, to be better understood and for its practical applications to be appreciated, the following Figures are provided and referenced hereafter. It should be noted that the Figures are given as examples only and in no way limit the scope of the invention. Like components are denoted by like reference numerals.

FIG. 1 schematically illustrates a block diagram of a system for creating a fragmented video recording of a screen using a cloud-based serverless architecture, in accordance with some embodiments of the present invention;

FIG. 2 schematically illustrates a screen agent registration to a cloud computing service (CCS), in accordance with some embodiments of the present invention;

FIG. 3 schematically illustrates a flow diagram to start recording screen events on a hosted client, in accordance with some embodiments of the present invention;

FIG. 4 schematically illustrates a flow diagram to relay a recording request to a hosted client using signaling services, in accordance with some embodiments of the present invention;

FIG. 5 schematically illustrates video fragment files in a queue, in accordance with some embodiments of the present invention;

FIG. 6 schematically illustrates a flow diagram for uploading video fragment files in a queue to a screen recording bucket, in accordance with some embodiments of the present invention;

FIG. 7 schematically illustrates a flow diagram for issuing a stop recording request to a hosted client at a client computer, in accordance with some embodiments of the present invention;

FIG. 8 schematically illustrates a flow diagram for uploading an end notification to a Simple Queue Service (SQS), in accordance with some embodiments of the present invention;

FIG. 9 schematically illustrates a flow diagram for a cloud computing application to issue a POP message to signaling services, in accordance with some embodiments of the present invention;

FIG. 10 schematically illustrates a flow diagram for merging video fragment files, in accordance with some embodiments of the present invention;

FIG. 11 schematically illustrates a flow diagram for storing a single recording file created by merging video fragment files in a cloud storage service, in accordance with some embodiments of the present invention;

FIG. 12 schematically illustrates an architecture for managing a fragmented recording of events on a screen of a client computer by a server, in accordance with some embodiments of the present invention;

FIG. 13 is a flowchart illustrating a method for managing a backlog of video fragment files in a queue, in accordance with some embodiments of the present invention;

FIG. 14 is a graph of merge time versus a number of video fragment files, in accordance with some embodiments of the present invention;

FIG. 15 is a flowchart of a method for creating a fragmented recording of events on a screen of a computer, in accordance with some embodiments of the present invention; and

FIG. 16 is a flowchart of a method for managing a fragmented recording of events on a screen of a client computer by a server, in accordance with some embodiments of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those of ordinary skill in the art that the invention may be practiced without these specific details. In other instances, well-known methods, procedures, components, modules, units and/or circuits have not been described in detail so as not to obscure the invention.

Although embodiments of the invention are not limited in this regard, discussions utilizing terms such as, for example, “processing,” “computing,” “calculating,” “determining,” “establishing”, “analyzing”, “checking”, or the like, may refer to operation(s) and/or process(es) of a computer, a computing platform, a computing system, or other electronic computing device, that manipulates and/or transforms data represented as physical (e.g., electronic) quantities within the computer's registers and/or memories into other data similarly represented as physical quantities within the computer's registers and/or memories or other information non-transitory storage medium (e.g., a memory) that may store instructions to perform operations and/or processes. Although embodiments of the invention are not limited in this regard, the terms “plurality” and “a plurality” as used herein may include, for example, “multiple” or “two or more”. The terms “plurality” or “a plurality” may be used throughout the specification to describe two or more components, devices, elements, units, parameters, or the like. Unless explicitly stated, the method embodiments described herein are not constrained to a particular order or sequence. Additionally, some of the described method embodiments or elements thereof can occur or be performed simultaneously, at the same point in time, or concurrently. Unless otherwise indicated, use of the conjunction “or” as used herein is to be understood as inclusive (any or all of the stated options).

Embodiments of the present invention herein provide a method and system for creating a fragmented video recording of events on a screen of a client computer of a call center agent, for example, in a customer call center. A supervisor or quality control personnel at a remote location may send a request via a communication network to the client computer to start recording events on the computer screen. A screen agent application operating on the agent's client computer may start to capture screen events on the client computer.

In the context of the present disclosure, the term “screen events” may refer to audio signals captured by an input device at the agent's terminal, such as the agent's voice captured with a microphone, audio signals captured by an output device, such as the customer voice heard by the agent in a headphone or external speaker while interacting with the agent, videos on the screen that the agent is viewing, and/or still images on the screen that the agent is viewing. The agent may be interacting with another person, such as a customer, via a regular phone call and/or video call agents and/or applications operating on the agent's computer (e.g., client computer). These captured interactions may also be screen events in the context of the present disclosure.

In some embodiments of the present invention, the client computer may be configured to record the captured screen events to small media files, which also may be referred to herein as video fragment files. The client computer may use and/or apply video fragment settings such as a group of pictures (GOP) size, a frame rate, and a frame duration when creating video fragment files from the captured screen events. The video fragment files may be stored in a storage device such as the local disk on the computer. The video fragment files may be queued in the storage device. Each video fragment file in the queue may relayed or uploaded independently over the communication network, typically using a Hypertext Transfer Protocol Secure (HTTPS) protocol, to a location in a server storage device on the server (e.g., distributed cloud storage).

In response to a request sent to the client computer to stop recording the screen events on the client computer, the client computer may send a notification to the server that all of the plurality of video fragment files in the queue were relayed to the server over the communication network. The server may merge the video fragment files into a single video recording file on the server and subsequently stored on the server for playback.

In some embodiments of the present invention, the server may use a serverless computing service, such as Amazon Web Services (AWS) Lambda, for example, for managing the received video fragment files and merging them into a single recording file. Serverless computing may include a cloud-computing execution model in which the cloud provider may dynamically manage the allocation of machine resources. AWS Lambda services, for example, may allow a customer to run code without provisioning or managing servers. The customer may pay only for the computing time.

In some embodiments of the present invention, if the client computer detects a backlog in the queue of video fragment files to be sent to the server over the communication network, for example, to due to network latency or other impairments, smart algorithms may be used to dynamically change the video fragment file settings to reduce the backlog in the queue.

Thus, the embodiments taught herein overcome the technical problems seen in NAT and WebRTC based real time streaming solutions where the captured screen events on the client computer are streamed over the communication network to the server and the recording is created on the server side. In contrast to the embodiments taught herein, all recordings are created on the client computer as video fragment files in response to a start recording request. Each of the video fragment files may be individually uploaded to the server. The server may be used to merge the video fragment files to a single recording file. Thus, there is no loss of the beginning of the recording due to network latency. The video may exhibit higher video quality from start to finish with a smooth playback seek as well as faster forward and backward effects.

Furthermore, the embodiments taught herein provide a distributed recording solution where a network disconnection, for example, may have little or no effect. In that storage and/or merging of the video fragment files on the server may use a cloud serverless computing architecture, the overall recording solution may be more cost effective.

FIG. 1 schematically illustrates a block diagram of a system 10 for creating a fragmented video recording of a screen using a cloud-based serverless architecture, in accordance with some embodiments of the present invention. System 10 may include one or more client computers 15 (e.g., at least one client computer) communicating with a server 20 via a communication network 50. Each client computer 15 may include a processor 25 coupled to a memory 33, a storage device 47, an input device 45, an output device 35, and communication interface circuitry 40 for communicating with communication network 50. Similarly, server 20 may include a server processor 60 coupled to a memory 65, a storage device 90, an input device 85, an output device 75, and a communication interface circuitry 80 for communicating with communication network 50.

Although many of the figures shown hereinbelow may relate to a recording of screen events into a plurality of video fragment files on one client computer that are relayed to server 20 for merging into a single video recording file, for example, this is not by way of limitation of the embodiments taught herein. Any number of users using any respective number of the one or more client computers 15 as shown in FIG. 1 may have screen events recorded to video fragment files. The video fragment files from each client computer may be relayed to server 20 for merging into a respective recording file for playback by a supervisor or manager, for example. Thus, an interaction manager service 70 operating on processor 70 on server 20 may be used to manage the recording of screen events of multiple client computers 15 (e.g., on multiple agent desktops) and to merge the video fragment files from each of the client computers into a single recording file.

Client processor 25 and server processor 60 may include one or more processing units, e.g. of one or more computers. Processors 25 and 60 may be configured to operate in accordance with programmed instructions stored in memories 33 and 65 and/or storage devices 47 and 90. Client processor 25 may be capable of executing a screen agent application 30 for recording events at the computer terminal, screen and/or display of a customer agent desktop computer terminal at a customer call center. Similarly, server processor 60 may be capable of executing interaction manager service 70 for managing and merging the video fragment files received over communication network from each of client computers 15 as well as implementing the serverless computing services for implementing these functions.

Client processor 25 may communicate with output device 35. For example, output device 35 may include a computer monitor or screen on which screen agent 30 may capture the screen events and record them into a plurality of video fragment files. Output device 35 may include a printer, display panel, speaker, or another device capable of producing visible, audible, or tactile output.

Similarly, server processor 60 may communicate with a screen of output device 75 to display an graphic user interfaces for interaction manager service 70 for the user (e.g., manager or quality control users) of the system 10 to manage the recording of the screen events on each of the one or more client computers 15. Output device 75 may include a printer, display panel, speaker, or another device capable of producing visible, audible, or tactile output.

Processor 25 may communicate with input device 45. For example, input device 45 may include one or more of a keyboard, keypad, or pointing device for enabling a user to inputting data or instructions for operation of processor 25, which may be captured as screen events. Input device 45 may also include a microphone capturing the audio signals of a voice of the client computer user (e.g., an agent's voice during a customer call), which may be also included in the captured screen events.

Similarly, server processor 60 may communicate with server input device 85. For example, server input device 85 may include one or more of a keyboard, keypad, or pointing device for enabling a user of system 10 (e.g., supervisor or quality control managers) for inputting data or instructions for operation of processor 60 (e.g., via interaction service manager 70).

Processors 25 and 60 may communicate with respectively with memories 33 and 65. Memories 33 and 65 may each include one or more volatile or nonvolatile memory devices. Memories 33 and 65 may be utilized to store, for example, programmed instructions for operation of processors 25 and 60, respectively, data or parameters for use by processors 25 and 60 during operation, or results of operations of processors 25 and 60.

Processors 25 and 60 may communicate with respectively with data storage devices 47 and 90. Data storage devices 47 and 90 may each include one or more fixed or removable nonvolatile data storage devices such as a local disk. For example, data storage devices 47 and 90 may include a computer readable medium for storing program instructions for operation of processors 25 and 60, respectively. It is noted that data storage devices 47 and 90 may be remote from their respective processors 25 and 60. Data storage devices 47 and 90 may be utilized to store data or parameters for use by respective processors 25 and 60 during operation, or results of operation of processors 25 and 60, respectively.

In operation, processors 25 and 60 may execute a method for creating a fragmented video recording of screen events on one or more client computer 15 using a cloud serverless architecture implemented on server 20. Processor 25 may implement screen agent 30 to manage the creation of video fragment files in accordance with the video fragment settings as well as to upload them to server 20 (e.g., a cloud server) over communication network 50. Processor 60 may execute serverless computing processes such as AWS lambda for receiving the video fragment files over communication network 50, for storing them in a location in storage 90, and for merging them into a single recording file for use by a user of system 20 (e.g., a supervisor).

The following FIGS. 2-11 hereinbelow are flow diagrams for illustrating each step in a method for creating a fragmented video recording of screen events on client computer 15 (e.g., on the client computer side), which may relayed to server 20. Server 20 may implement a cloud serverless architecture, (e.g., AWS Lambda) for managing the received video fragment files and merging them into a single recording file on the server side.

FIG. 2 schematically illustrates a screen agent registration 95 to a cloud computing service (CCS) 100, in accordance with some embodiments of the present invention. CCS 100, such as Amazon Web Services (AWS), may include interaction manager service 105 and a cloud storage service 110, such as Amazon Web Services/S3, for example, executed by server processor 60.

An agent at client computer 15 executing a screen agent client 120 (e.g., screen agent 30 on client computer 15 in FIG. 1) in a call center 115 (e.g., on premise customer) may log 130 into CCS 100, which initiates a new publication or subscription service 125 specific to screen agent 120. Subscription service 125 may be a cloud signaling service using Websocket, HTTP Streaming, or HTTP long pooling. Through this channel signaling (e.g., signaling services 155 shown in FIG. 3 below), screen agent client 120 may receive control commands such as StartRecord, StopRecord commands, for example. This signaling channel may use communication network 15 (e.g., used for uploading video fragment files using HTTPS) or any another suitable communication network separate from communication network 15. The agent may initiate a call with a customer.

FIG. 3 schematically illustrates a flow diagram 140 to start recording screen events on a hosted client 150, in accordance with some embodiments of the present invention. When the agent initiates a call with a customer and registers with new publication or subscription service 125 as shown in FIG. 2, interaction manager service 105 may decide to issue a recording command 170. The decision to issue recording command 170 may be based on a supervisor or Quality Management staff manually entering the command, or automatically in accordance with some predefined criteria monitored by interaction manager service 105. Recording command 170 may relayed to a cloud computing application service 160. In response, a new cloud computing application service 160, such as AWS Lambda, may be established to provide a serverless Screen Recorder Service to manage the video fragment files specifically for hosted client 150 that logged in.

FIG. 4 schematically illustrates a flow diagram 180 to relay a recording request 185 to hosted client 150 using signaling services 155, in accordance with some embodiments of the present invention. Cloud computing application service 160 (e.g., serverless Screen Recorder Service) may relay a recording request 185A in response to recording command 170 to signaling services 155, which may subsequently relay recording request 185B to hosted client 150. In some embodiments, signaling services 155 may be implemented by a signal server (not shown), which is a network component that sets up a signaling gateway for transferring signal messages.

In some embodiments of the present invention, recording request 185 may include a tenant ID, a target bucket name, Security Token Service (STS) keys, a recording ID, and a request type.

In some embodiments of the present invention, the recording command 185 may include as shown in a parameter block 190: a target upload location, such as a location of screen recording buckets 165 and temporary credentials (AWS STS) to allow a PUT request. With these recording command parameters, screen agent 30 may upload the captured screen data in the recorded video fragment files to a Distributed Cloud Storage (e.g., cloud storage service 110).

FIG. 5 schematically illustrates video fragment files in a queue 195, in accordance with some embodiments of the present invention. In response to screen agent 30 receiving recording request 185, screen agent 30 of hosted client 150 may capture screen events or activity. The captured screen events may be stored a standard container such as MP4, or WebM, for example, to form video fragment files denoted individually as 1 and 2 (e.g., Container 1 and Container 2) in queue 195 in FIG. 5. The captured screen events may be stored in containers at predefined intervals such as every X seconds, for example, where X is any number. After X seconds, screen agent 30 may finalize the container file and start to create a new container file. X may or may not be the same for all video fragment files and may be varied dynamically with changes in the video fragment settings.

FIG. 6 schematically illustrates a flow diagram 202 for uploading video fragment files in queue 195 to screen recording bucket 165, in accordance with some embodiments of the present invention. As the video fragment files are formed, the video fragment files denoted 1.mp4, 2.mp4 . . . N.mp4 are uploaded 200A from queue 195 over communication network 50 using an HTTPS protocol to screen recording bucket 165 (e.g., the target upload location in distributed cloud storage). The uploaded video fragment files in screen recording bucket 165 are shown in FIG. 6 as MP4 files 200. Also shown in FIG. 6, queue 195 is empty. In some embodiments, each of the uploaded files may include a tenant ID, a recording ID, and an MP4 file.

FIG. 7 schematically illustrates a flow diagram 204 for issuing a stop recording request 210 to hosted client 150 at client computer 15, in accordance with some embodiments of the present invention. On or before the end of the agent's call, interaction manager service 105 may issue a stop recording command 205 to cloud computing application 160. In response, cloud computing application 160 issues a stop recording request 210A which may be relayed to signaling services 155, which then further relays stop recording request 210B to screen agent 30 of hosted client 150. Stop recording request 210 may include a location (e.g., target queue on server 20) of screen recordings bucket 165 and temporary credentials (AWS STS) to allow PUT messages into the target queue of screen recordings bucket 165. Simple Queue Service (SQS) 207 may monitor the relaying of video fragment files to screen recordings bucket 165. Using these parameters, screen agent 30 may notify SQS 207 about screen event capturing and upload completion.

FIG. 8 schematically illustrates a flow diagram 215 for uploading an end notification 230 to SQS 207, in accordance with some embodiments of the present invention. When the last media container (e.g., video fragment file) had been uploaded from the client computer to screen recording bucket 165, a client notification message including a recording ID number 225 notifying SQS 207 that all of the video fragment files have been uploaded to Target Cloud Queue (e.g., screen recording buckets 165).

In some embodiments of the present invention, upload end notification 230 may include a tenant ID, a recording ID, a number of files, and a metadata file.

FIG. 9 schematically illustrates a flow diagram 232 for cloud computing application 160 to start a video recording by issuing a POP message 230 to signaling services 155, in accordance with some embodiments of the present invention. For completion of the recording flow, cloud computing application 160 may execute a Serverless Post Recording Service, and may issue pop message 230 to signaling services 155.

FIG. 10 schematically illustrates a flow diagram 242 for merging video fragment files 200, in accordance with some embodiments of the present invention. All video recording fragments files 200 may be downloaded from the Target Upload Location (e.g., screen recordings bucket 165 in cloud storage service 110) by cloud computing application 160 (e.g., AWS Lambda) and may be merged as denoted by an arrow 255 into a single standard media file (e.g., single recording file).

FIG. 11 schematically illustrates a flow diagram 250 for storing a single recording file 260 created by merging video fragment files 200 in cloud storage service 110, in accordance with some embodiments of the present invention. Single recording file 260 denoted Complete.mp4 may be subsequently uploaded to cloud storage service 110 for later playback.

FIG. 12 schematically illustrates an architecture 300 for managing a fragmented recording of events on a screen of a client computer by a server, in accordance with some embodiments of the present invention. Architecture diagram 300 shows how the solution for creating a fragmented video recording of a screen using a cloud-based serverless architecture fits within a product framework. The recording flow may be the initial step for many contact center solution services, based on automatic call distributor (ACD) 301 events, provided by a product framework such as Quality Management (QM) 325, Analytics 330, Work Force Management (WFM) 335, and Playback 340. Management services 310 may operate the recorders (e.g., screen recording 315 and audio recording 320) based on ACD 301 events, and may store the metadata to serve the product framework by using a search 350 and elastic search services 355 as shown in a block 345.

In some embodiments of the present invention, screen agent client 30 may include a smart upload algorithm to dynamically adjust the video fragment settings during the creation of the video fragment files with the captured screen events. The smart algorithm may be configured to ensure a complete and high quality video recording of screen events, a minimal client footprint (e.g., low CPU loading, low memory loading, low disk space usage, and little network throughput loading), no backlogs in the server storage location (e.g., screen recording bucket 165) or client computer queue (e.g., queue 195), a responsive seek (forward and backward) function during playback of complete video recording file 260, and minimal computational and storage costs in using the serverless computing application.

In some embodiments of the present invention, there may be tradeoffs when adjusting the various video fragment settings of OOP size, file duration, and frame rate. For example, (1) increasing the GOP size may reduce the recorded fragment file size (e.g., larger GOP size results less Iframes) and may reduce network throughput. However, increasing the GOP size may also result in a poor seek performance during playback.

(2) Decreasing the GOP size may enable a better seek performance during playback since a smaller GOP results in more Iframes. However, decreasing the GOP size may also result in a higher network throughput and a backlog in queue 195 of fragment files to be uploaded due to the larger video fragment file sizes.

(3) Decreasing file duration may result in better seek performance during playback since each fragment file starts with an Iframe and more fragment files for lower file duration results in more Iframes. However, decreasing the file duration results in more fragment files to be uploaded. This may result in higher serverless computing service pricing for merging the increased number of files on the server side, a backlog of files to be merged in the server side, and a higher storage price due to a higher number of HTTP upload (PUT) requests. Note that if a backlog occurs due to many video recordings in the server side, this may trigger new cloud computing application 160, such as AWS Lambda to be established increasing the pricing.

(4) Increasing the file duration may decrease network throughput since increasing the file duration may subsequently result in less fragment files and less Iframes. This may also decrease the number of files to be merged on the server side. However, increasing the file duration may result in the need for larger client disk space as well as increasing the risk of losing larger portions of the recording in the event of network or system failure.

(5) Reducing frame rate to reduce file size may result in a decrease in network throughput since there are less fragment files and less Iframes. However reducing the frame rate may result in poor seek performance during playback and reduced video quality.

In some embodiments of the present invention, nominal video fragment settings may be initially used and adjusted dynamically as impairments in the network quality and/or video quality may be detected. The nominal video fragment settings (e.g., default settings) may include:

-   -   File Duration—60 seconds (Iframe per file)     -   Group of Pictures—400 (for reasonably responsive seek function)     -   Average File Size—2 MB     -   Average file upload time—6 seconds (e.g., to client side, no         backlog)     -   Process CPU—˜20%     -   Average Merge time—0.5 second per file on server side, no         backlog     -   Video Visual Quality—Checked

In some embodiments of the present invention, a backlog of video fragment files in queue 195 may be defined where the number of video fragment files in queue 195 exceeds a predefined number, such as 2 video fragment files, for example. When a backlog is detected by client processor 25 (e.g., by screen agent 30), different actions may be taken to manage the backlog.

FIG. 13 is a flowchart illustrating a method 450 for managing a backlog of video fragment files in queue 195, in accordance with some embodiments of the present invention. Method 450 may be performed by screen agent 30 executed by client processor 25 on client computer 15.

Method 450 may include monitoring 455 queue 195 of fragment files to be uploaded to screen recording bucket 165.

Method 450 may include a decision step 460 where processor 25 assesses if there is a backlog in queue 195. If not, processor continues to monitor 455 the queue. If there is a backlog, then processor 25 may assess whether the files in the backlog exceeds a predetermined size in a decision step 165. If so, processor 25 stops 470 the recording of the screen events as an error condition exists. In some embodiments, the already-created video fragment files may be deleted.

If not, processor 25 may then continue to assess if the video fragment files exceed a duration limit in a decision step 475. If not, processor 25 may adjust the video fragment settings, so as to increase 480 the file size, and processor 25 continues to monitor 455 the queue.

If the fragment files do exceed a file duration limit, processor 25 then assesses if the fragment files excess a group of pictures limit in a decision step 485. If so, processor 25 may stop 470 the recording. If not, processor 25 may adjust the video fragment settings so as to increase 480 the file size, and processor 25 then continues to monitor 455 the queue.

In some embodiments of the present invention, the video fragment files may be encrypted, relayed over the communication network to the server, and decrypted and merged in cloud computing application 160 (e.g., in AWS Lambda, for example).

FIG. 14 is a graph 550 of merge time versus a number of video fragment files, in accordance with some embodiments of the present invention. Graph 550 illustrates that the measured merge time may be substantially linear with the number of files to merge. Graph 550 illustrates on the horizontal axis the number of files to be merged where each file had an average size of 900 KB and an average duration of 8 seconds. Marker 560 illustrates the merge time and the number of recorded video fragment files for a 5 minute call, marker 565 for a 10 minute call, marker 570 for a 15 minute call.

In the Tables below, the CPU usage was found empirically to remain substantially the same at about 30% for all configurations shown in the Tables as follows:

TABLE I Measured Merge Time of the same number of file (10 files) of different sizes AVG. FILE AVG. FILE MERGE SIZE DURATION TIME 5000 45 1.82 1600 19 1.992 600 7 2.002 300 3 1.791

TABLE II Measured Merge Time for a different number of files with different sizes for a 5 minute recording NUMBER AVG. FILE AVG. FILE MERGE OF FILES SIZE DURATION TIME 6 5000 45 1.166 15 1600 19 2.366 40 800 7 3.049 61 600 5 4.733 95 400 3 6.633

TABLE III Measured Merge Time for a different number of files with different sizes for a 10 minute recording NUMBER AVG. FILE AVG. FILE MERGE OF FILES SIZE DURATION TIME 12 5000 45 2.186 30 1600 19 3.223 80 800 7 6.773 122 600 5 8.77 190 400 3 13.007

Measurements were taken to determine the optimal GOP, file size, and recording length to determine the optimal upload times as shown in Table IV below:

TABLE IV Measured Data for Determining Optimal Upload Time Segment File Upload GOP Frame Length Length File Time Upload size Rate Parameter (sec) Size (sec) Rate 400 15 400/15 = 26  335 KB 6.5  51 KB/sec 26 43  555 KB 3.4 163 KB/sec 26  343 KB 2.6 131 KB/sec 25  382 KB 6.8  56 KB/sec 18  699 KB 3.2 218 KB/sec 400 15  80 82 5,280 KB 7.9 668 KB/sec 78 1,825 KB 4.5 405 KB/sec 85 2,032 KB 4.3 472 KB/sec 405 15 108 109 2,622 KB 7.2 364 KB/sec 109 1,986 KB 4.2 472 KB/sec 119 2,482 KB 4.7 528 KB/sec 104 3,557 KB 4.8 741 KB/sec 100 1,827 KB 4.3 424 KB/sec 128 2,079 KB 15.8 131 KB/sec 378 27 378/27 = 18  346 KB 7.1  48 KB/sec 14 10  216 KB 2.4  90 KB/sec 18  714 KB 3.2 223 KB/sec 14  284 KB 2.5 113 KB/sec 8  100 KB 2.1  47 KB/sec 21  924 KB 3.4 271 KB/sec 12  262 KB 2.5 104 KB/sec 10  332 KB 2.9 114 KB/sec 20  599 KB 3.1 193 KB/sec 6  158 KB 2.4  65 KB/sec 14  613 KB 3 204 KB/sec 450 15 120 122 2,066 KB 9.4 219 KB/sec 145 2,110 KB 5.3 398 KB/sec 103 1,524 KB 3.7 411 KB/sec 113 1,468 KB 4.1 358 KB/sec 131 1,840 KB 4.6 400 KB/sec 450 15  60 70 2,144 KB 7 306 KB/sec 57 1,163 KB 3.8 306 KB/sec 60 3,091 KB 9.7 318 KB/sec 81 2,582 KB 4.8 537 KB/sec 46 1,156 KB 3.6 321 KB/sec 60  911 KB 8 113 KB/sec 44 1,004 KB 3.5 286 KB/sec 59 3,844 KB 5 768 KB/sec

FIG. 15 is a flowchart of a method 600 for creating a fragmented recording of events on a screen of a computer, in accordance with some embodiments of the present invention. Method 600 may be performed by client processor 25 on client computer 15.

Method 600 may include receiving 605 a start recording request to record events on a screen (e.g., output device 35) of a client computer 15 communicating with server 20 over communication network 50.

Method 600 may include capturing 610 the events on the screen.

Method 600 may include storing 615 the captured screen events in a plurality of video fragment files in a queue in storage 47 (e.g., a local disk) in client computer 15 using video fragment settings. The video fragment settings applied while storing 615 the captured screen events into the plurality of video fragment files may include but not limited to the video frame rate, the group of pictures (GOP) size, and/or video fragment file duration, for example.

Method 600 may include sending 620 the plurality of video fragment files in the queue over the communication network to storage 90 in server 20.

Method 600 may include dynamically changing 625 the video fragment settings upon detecting a backlog in the queue of the plurality of the video fragment files in the storage.

Method 600 may include sending 630 a notification to the server that the plurality of video fragment files in the queue were relayed to the server over the communication network in response to receiving a stop recording request from the server.

FIG. 16 is a flowchart of a method 650 for managing a fragmented recording of events on a screen of a client computer by a server, in accordance with some embodiments of the present invention. Method 650 may be performed by server processor 60 on server 20.

Method 650 may include sending 655 to client computer 15 a start recording request to record events on screen 35 of client computer 15.

Method 650 may include receiving 660 a plurality of video fragment files for storage in a location (e.g., screen recording bucket 165) of server storage 90, where each of the video fragment files include screen events captured on screen 35 of client computer 15.

Method 650 may include sending 665 a stop recording request to client computer 15.

Method 650 may include receiving 670 a notification that all of the plurality of video fragment files were relayed from client computer 15 over communication network 50 to server 20.

Method 650 may include merging 675 the plurality of video fragment files into a single video recording file in response to the received notification.

In some embodiments of the present invention, a computerized method for creating a fragmented recording of events on a screen of a computer may include a processor of a computer configured to execute code for recording events on a screen of the computer. A start recording request may be received by the processor to record the screen events from a server communicating with the computer over a communication network. The events on the screen may be captured by the processor. The captured screen events may be stored by the processor into a storage of the computer in a plurality of video fragment files in a queue in the storage by applying video fragment settings. The plurality of video fragment files in the queue may be sent by the processor over the communication network to the server. The video fragment settings may be dynamically changed upon detecting a backlog in the queue of the plurality of video fragment files in the storage. In response to receiving a stop recording request from the server, a notification may be sent by the processor to the server that all of the plurality of video fragment files in the queue were relayed to the server over the communication network for the server to merge the plurality of video fragment files into a single video recording file.

In some embodiments of the present invention, sending the plurality of video fragments may include uploading the plurality of video fragment files to a distributed cloud storage.

In some embodiments of the present invention, detecting the backlog in the queue may include detecting that a number of the plurality of video fragment files exceeds a predefined number.

In some embodiments of the present invention, dynamically changing the video fragment settings may include changing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, dynamically changing the video fragment settings may include changing a frame rate of the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, dynamically changing the video fragment settings may include changing a frame duration of the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, the captured screen events may be selected from the group consisting of: audio signals captured by an input device, audio signals captured through an output device, videos captured on the screen, and images captured on the screen.

In some embodiments of the present invention, the communication network may use a Hypertext Transfer Protocol Secure (HTTPS) protocol for sending the plurality of video fragment files to the server.

In some embodiments of the present invention, a client computer for creating a fragmented recording of events on a screen of the computer may include a storage, a screen, and a processor. The processor may be configured to execute code for recording events on the screen, to receive a start recording request to record the screen events from a server communicating with the computer over a communication network, to capturing the events on the screen, to store the captured screen events into a plurality of video fragment files in a queue in the storage by applying video fragment settings, to send the plurality of video fragment files in the queue over the communication network to the server, to dynamically change the video fragment settings upon detecting a backlog in the queue of the plurality of video fragment files in the storage, and in response to receiving a stop recording request from the server, to send a notification to the server that all of the plurality of video fragment files in the queue were relayed to the server over the communication network for the server to merge the plurality of video fragment files into a single video recording file.

In some embodiments of the present invention, the storage may include a local disk on the client computer.

In some embodiments of the present invention, the code for recording the events on the screen may include a screen agent client software.

In some embodiments of the present invention, the plurality of video fragment files may include a plurality of standard containers.

In some embodiments of the present invention, the processor may be configured to detect the backlog in the queue by detecting that a number of the plurality of video fragment files exceeds a predefined number.

In some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a frame rate of the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, the processor may be configured to dynamically change the video fragment settings by changing a frame duration of the plurality of video fragment files for managing the backlog.

In some embodiments of the present invention, the communication network may use a Hypertext Transfer Protocol Secure (HTTPS) protocol to send the plurality of video fragment files to the server.

In some embodiments of the present invention, a computerized method for managing a fragmented recording of events on a screen of a client computer in a computerized system by a server communicating with a client computer over a communication network, sending by a processor to the client computer a start recording request to record events on a screen of the client computer. A plurality of video fragment files may be received by the processor for storage in a location of a storage in the server, wherein each of the video fragment files include screen events captured on the screen of the client computer. A stop recording request may be sent to the client computer. A notification may be received by the processor that all of the plurality of video fragment files were relayed from the client computer over the communication network. In response to the received notification, the plurality of video fragment files may be merged by the processor into a single video recording file.

In some embodiments of the present invention, the processor may execute a serverless computing service for managing the plurality of video fragment files to create the single video recording file.

It should be understood with respect to any flowchart referenced herein that the division of the illustrated method into discrete operations represented by blocks of the flowchart has been selected for convenience and clarity only. Alternative division of the illustrated method into discrete operations is possible with equivalent results. Such alternative division of the illustrated method into discrete operations should be understood as representing other embodiments of the illustrated method.

Similarly, it should be understood that, unless indicated otherwise, the illustrated order of execution of the operations represented by blocks of any flowchart referenced herein has been selected for convenience and clarity only. Operations of the illustrated method may be executed in an alternative order, or concurrently, with equivalent results. Such reordering of operations of the illustrated method should be understood as representing other embodiments of the illustrated method.

Different embodiments are disclosed herein. Features of certain embodiments may be combined with features of other embodiments; thus certain embodiments may be combinations of features of multiple embodiments. The foregoing description of the embodiments of the invention has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. It should be appreciated by persons skilled in the art that many modifications, variations, substitutions, changes, and equivalents are possible in light of the above teaching. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention.

While certain features of the invention have been illustrated and described herein, many modifications, substitutions, changes, and equivalents will now occur to those of ordinary skill in the art. It is, therefore, to be understood that the appended claims are intended to cover all such modifications and changes as fall within the true spirit of the invention. 

The invention claimed is:
 1. A computerized method for cloud-based video recording of events on a screen of a client computer, where the recording is created on the client computer in response to a start recording request from a server of the cloud-based video recording, the method comprising: in a processor of a client computer configured to execute code for recording events on a screen of the client computer, receiving a start recording request to record the screen events from a server of a cloud-based video recording that is communicating with the client computer over a communication network; capturing by the processor, the events on the screen of the client computer; storing by the processor in a storage of the client computer, the captured screen events into a plurality of video fragment files in a queue in the storage, using video fragment settings; uploading the plurality of video fragment files in the queue to the server of the cloud-based video recording by sending by the processor, the plurality of video fragment files in the queue over the communication network to the server of the cloud-based video recording; dynamically changing the video fragment settings of the plurality of video fragment files, upon detecting a backlog in the queue of the plurality of video fragment files in the storage; and in response to receiving a stop recording request from the server of the cloud-based video recording, sending by the processor a notification to the server of the cloud-based video recording that all of the plurality of video fragment files in the queue were relayed to the server of the cloud-based video recording, over the communication network for the server of the cloud-based video recording to merge the plurality of video fragment files into a single video recording file, wherein the dynamically changing of the video fragment settings of the plurality of video fragment files, upon detecting a backlog in the queue, is performed to reduce network throughput while avoiding poor seek performance and reduced video quality of the single video recording file.
 2. The method according to claim 1, wherein sending the plurality of video fragments comprises uploading the plurality of video fragment files to a distributed cloud storage in the server of the cloud-based video recording.
 3. The method according to claim 1, wherein detecting the backlog in the queue comprises detecting that a number of the plurality of video fragment files exceeds a predefined number.
 4. The method according to claim 1, wherein dynamically changing the video fragment settings comprises increasing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.
 5. The method according to claim 1, wherein dynamically changing the video fragment settings comprises reducing a frame rate of the plurality of video fragment files for managing the backlog.
 6. The method according to claim 1, wherein dynamically changing the video fragment settings comprises increasing a file duration of the plurality of video fragment files for managing the backlog.
 7. The method according to claim 1, wherein the captured screen events are selected from the group consisting of: audio signals captured by an input device, audio signals captured through an output device, videos captured on the screen of the client computer, and images captured on the screen of the client computer.
 8. The method according to claim 1, wherein the communication network uses a Hypertext Transfer Protocol Secure (HTTPS) protocol for sending the plurality of video fragment files from the client computer to the server of the cloud-based video recording.
 9. A client computer for cloud-based video recording of events on a screen of the client computer, where the recording is created on the client computer in response to a start recording request from a server of the cloud-based video recording, the client computer comprising: a storage; a screen; and a processor configured to execute code for recording events on the screen of the client computer, to receive a start recording request to record the screen events from a server of the cloud-based video recording communicating with the client computer over a communication network, to capturing the events on the screen of the client computer, to store the captured screen events into a plurality of video fragment files in a queue in the storage using video fragment settings, to send the plurality of video fragment files in the queue over the communication network to the server of the cloud-based video recording, to dynamically change the video fragment settings of the plurality of video fragment files, upon detecting a backlog in the queue of the plurality of video fragment files in the storage, and in response to receiving a stop recording request from the server of the cloud-based video recording, to send a notification to the server of the cloud-based video recording that all of the plurality of video fragment files in the queue were relayed to the server of the cloud-based video recording over the communication network for the server of the cloud-based video recording to merge the plurality of video fragment files into a single video recording file, wherein the dynamically change of the video fragment settings of the plurality of video fragment files, upon detecting a backlog in the queue, is performed to reduce network throughput while avoiding poor seek performance and reduced video quality of the single video recording file.
 10. The client computer according to claim 9, wherein the storage comprises a local disk on the client computer.
 11. The client computer according to claim 9, wherein the code for recording the events on the screen may include a screen agent client software.
 12. The client computer according to claim 9, wherein the plurality of video fragment files comprises a plurality of standard containers.
 13. The client computer according to claim 9, wherein the processor is configured to detect the backlog in the queue by detecting that a number of the plurality of video fragment files exceeds a predefined number.
 14. The client computer according to claim 9, wherein the processor is configured to dynamically change the video fragment settings of the plurality of video fragment files, by increasing a group of pictures (GOP) size in the plurality of video fragment files for managing the backlog.
 15. The client computer according to claim 9, wherein the processor is configured to dynamically change the video fragment settings of the plurality of video fragment files, by reducing a frame rate of the plurality of video fragment files for managing the backlog.
 16. The client computer according to claim 9, wherein the processor is configured to dynamically change the video fragment settings by increasing a file duration of the plurality of video fragment files for managing the backlog.
 17. The client computer according to claim 9, wherein the captured screen events are selected from the group consisting of: audio signals captured by an input device, audio signals captured through an output device, videos captured on the screen of the client computer, and images captured on the screen of the client computer.
 18. The client computer according to claim 9, wherein the communication network uses a Hypertext Transfer Protocol Secure (HTTPS) protocol to send the plurality of video fragment files to the server of the cloud-based video recording.
 19. A computerized method for managing a fragmented recording of events on a screen of a client computer in a computerized system by a server of a cloud-based video recording, the computerized method comprising: in a processor of a server of a cloud-based video recording communicating with a client computer over a communication network, sending by the processor to the client computer a start recording request to record events on a screen of the client computer; receiving by the processor, a plurality of video fragment files for storage in a location of a storage in the server, wherein each of the video fragment files includes screen events captured on the screen of the client computer; sending by the processor a stop recording request to the client computer; receiving by the processor a notification that all of the plurality of video fragment files has been relayed from the client computer over the communication network; and in response to the received notification, merging by the processor the plurality of video fragment files into a single video recording file, wherein settings of the plurality of video fragment files have been dynamically changed in the client computer, upon detecting a backlog in a queue in the client computer, to reduce network throughput while avoiding poor seek performance and reduced video quality of the single video recording file.
 20. The method according to claim 19, wherein the processor executes a serverless computing service for managing the plurality of video fragment files to create the single video recording file. 